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Abstract: The application of Statistical Physics to social systems is mainly 
related to the search for macroscopic laws, that can be derived from exper- 
imental data averaged in time or space, assuming the system in a steady 
state. One of the major goals would be to find a connection between the 
statistical laws to the microscopic properties; for example to understand 
the nature of the microscopic interactions or to point out the existence 
of interaction networks. The probability theory suggests the existence of 
few classes of stationary distributions in the thermodynamics limit, so that 
the question is if a statistical physics approach could be able to enroll 
the complex nature of the social systems. We have analyzed a large GPS 
data base for single vehicle mobility in the Florence urban area, obtaining 
statistical laws for path lengths, for activity downtimes and for activity 
degrees. We show also that simple generic assumptions on the microscopic 
behavior could explain the existence of stationary macroscopic laws, with 
an universal function describing the distribution. Our conclusion is that 
understanding the system complexity requires dynamical data-base for the 
microscopic evolution, that allow to solve both small space and time scales 
in order to study the transients. 

Keywords and phrases: Urban Mobility, GPS data, Probability Distri- 
butions, Statistical Physics. 



1. Introduction 

Any statistical analysis of real systems is based on the Ergodic Principle for the 
microscopic dynamics, that implies the relaxation towards steady states and the 
independence property of elementary components. Even if the existence of mi- 
croscopic interactions is necessary for the system to evolve toward a statistical 
equilibrium, in this state any particle moves independently from the others and 
all the particles are statistically equivalent (any particle may be representative 
for the whole) . The thermodynamics laws that are derived from a statistical me- 
chanics approach, concern some macroscopic observables of the system, evolving 
adiabatically with respect the microscopic relaxation time (i.e. we can consider 
the whole system in a almost equilibrium state), so that the effects of single par- 
ticle dynamics arc conveniently described by means of stochastic processes. As 
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a consequence, there should exist a natural separation among macroscopic and 
microscopic space-time scales. Indeed space and time scales are expected to be 
strictly correlated: to understand small scale phenomena we need to solve short 
time scales and viceversa. Nevertheless the statistical mechanics has a great 
success in describing evolution of macroscopic systems and there is a strong 
effort to generalize the results for a non-equilibrium thermodynamics and for 
application to complex systems[l]. The statistical properties of social systems 
have been recently considered under a different point of view due to the possi- 
bility of recording large microscopic data sets [2, 3]. The main problem is what 
are the macroscopic effects of cognitive behavior for " social particles" . Indeed 
the cognitive behavior would imply the existence of strong bidirectional inter- 
actions among the dynamics at different space and time scales of the system [4]. 
Emergence and self-organization characterize the macroscopic states, but the 
question is which macroscopic observables (if they exist) may enrol the complex 
nature of the system. These variables may also play an important role in the 
study of phase transitions and in the control parameters definition. 
In Italy GPS data on individual vehicle paths are currently recorded for insur- 
ance reasons over a sample ~ 2% of the whole private vehicle population [5, 6]. 
This data set gives the opportunity to study the individual mobility demand in 
urban contexts. The GPS data set contains the geographical coordinates, the 
time, the instantaneous velocity and the path length of individual trajectories 
at positions whose relative distance is of order 1-^2 km. Special signals are 
recorded when the engine is switched on and off. We remark that the data refer 
mainly to the private transportation mobility and that, due to privacy legal 
problems, we do not have any knowledge on the social features of individuals in 
the sample. 

In this paper we analyze the statistical distributions of the path lengths of indi- 
vidual trajectories, of the activity downtime and the distribution of the monthly 
activity degree. Our aim is to point out the main macroscopic features of ur- 
ban mobility studying their correlation with the idea of " asystematic mobility" 
recently proposed by sociologists to explain the observational data in modern 
metropolis. We consider GPS data recorded during March 2008 in the Florence 
urban area. We show that some simple assumptions on single particles, like 
the existence of an " individual mobility energy" and of an " individual mobility 
time" that define the daily agenda, may explain the statistical laws emerging 
from the GPS data. Moreover in the equilibrium state individuals seem to mini- 
mize their interactions, behaving independently, so that the Maximum Entropy 
Principle of statistical mechanics can be applied [7]. These results are consistent 
with the idea that the sprawling phenomenon of modern cities implies that citi- 
zens move as stochastic particles [8] . Finally our analysis enlightens some average 
cognitive properties of individuals in urban mobility. The paper is organized as 
follows: in the first section we shortly described the GPS data base for vehicle 
mobility; in the remaining sections we discuss the three statistical laws on path 
lengths distribution, on the activities downtime and on the activity degree that 
are inferred from the data. 
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2. GPS data for vehicle mobility 

A sample of ~ 2% of the private vehicles in Italy has a GPS system for insurance 
reason. Any vehicle is associated to an ID number, so that it is possible to 
follow its mobility during a long time. Each datum gives position, velocity, 
covered distance from the previous measure and quality of signal. The data give 
a sampling of individual trajectories each ~ 2 km, but a signal is also recorded 
any time the engine is switched on or off. The data suffer from the GPS limited 
precision, in particular when the GPS looses the satellite signal. These problems 
are especially relevant at starting points of the trajectories or when vehicles 
are parked inside a building, and short paths could be strongly affected by 
these pathologies. When the quality of signal is good the time precision of the 
recorded data is practically perfect, whereas the space precision is of the order of 
10 m, usually sufficient to localize a vehicle on the road. Both the instantaneous 
velocity and the covered space, are given with an adequate precision since they 
result from a calculation based on GPS data recorded each second, but not 
registered. We have developed several methods to clean the data from spurious 
effects in order to avoid passible bias in the sample. 

In the present work we consider the GPS data in the Florence urban area 
recorded during March 2008: these data are related to 35,000 vehicles in a circu- 
lar area of radius ~ 30 Km, around the historical center and defining ~ 2.5 x 10^ 
different trajectories. Wc have restricted our analysis to the trajectories which 
start inside a circle of 10 Km around the historical town and remain inside the 
considered area, so that with a good probability we select people living and 
moving in Florence. Then we look for the vehicles which perform daily loops 
from starting points that we identify as "home": in this case the number of tra- 
jectories reduces to ~ 4. x 10^. In figure 1 we show the considered area where we 
have plotted the aggregate position GPS data: the color refers to the different 
instantaneous velocities (red means less than 30 Km/h, whereas yellow refers to 
a velocity in the interval 30 60 Km/h and the green to a velocity > 60 km/h). 
We are completely ignorant on the social composition of the sample and on the 
specific drivers, but we expect that such individuals perform a mobility related 
to the activities present in the Florence area and have a certain knowledge of 
the road network. 

3. Path length distribution 

The activity sprawling that characterizes the modern metropolis has certainly 
a strong influence on the individual mobility demand [9]. Even if the Florence 
historical center is a very special area full of artistic and tourist attractions, 
but forbidden to private traffic, nevertheless we assume the activities randomly 
distributed in the urban system. This hypothesis is quite reasonable because we 
consider a large urban area and our sample is surely composed by inhabitants 
and not tourists. As a consequence, we expect that the citizen mobility agenda 
are influenced by individual features rather than by the city structure. In cities 
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Fig 1 . Aggregated GPS position data in the Florence area recorded during March 2008; the 
red dots corresponds to a recorded instantaneous velocity < 30 Km/h, whereas the yellow dots 
correspond to velocities in the interval 30 60 Km/h and the green dots to a velocity > 60 
km/h. 

the stationary average traffic state should emerge as the result of individual in- 
teractions of cognitive particles which share the same spatial resources. In partic- 
ular we assume that drivers organize their mobility, by applying a minimization 
strategy of the interactions with other individuals [10]. In this conceptual frame- 
work, the mobility can be seen as the realization of many independent individual 
agenda and the dynamical properties become similar to that of a Boltzmann 
gas. Even if it is obviously true that individuals are non-identical particles, path 
length and activity downtime can be considered common mobility features to 
all people, and good candidates for a statistical physics approach to describe 
the stationary state. For each vehicle we have recorded the total lengths L of 
his daily round trips for the whole considered period. The length distribution 
is plotted in fig. 2 where we point out the existence of an interpolation with a 
Maxwell-Boltzmann distribution 

p(L) = Loexp(-L/Lo) (1) 

with Lo ~ 25 Km the characteristic daily path length. The distribution (1) 
provides a very good fit of the experimental data, and it can be justified by 
the Maximum Entropy Principle under the assumption that the individuals 
are independent particles and that there exists an average daily trip length in 
the population (see supplementary material). In such a case the distribution 
(1) is realized when any particle chooses its mobility energy randomly. It is 
straightforward to associate an individual "mobility energy" to the daily path 
lengths. The assumption that citizens organize their mobility as they own an 
internal "mobility energy", agrees with similar hypotheses discussed by R.Kolb 
and D.Helbing[ll] to explain the daily travel-time distributions for different 
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Fig 2. Empirical distribution of the daily mobility length L(dots). The continuous line is an 
interpolation with a Maxwell-Boltzmann distribution with average value < L >~ 25 Km. 



transport modes. 

In order to investigate the concept of the mobility energy together with the 
Maximum Entropy Principle, we consider the relation between the daily path 
length and the single trip path length, building up the rank distribution of indi- 
vidual daily activities. To define an activity from GPS mobility data, we apply a 
clustering procedure to vehicle stop positions, identifying the positions which lie 
in a circle of diameter ~ 500 m (this is considered a acceptable distance between 
the true destination and the parking place [12]). Moreover we have associated an 
activity when the elapsed time before the next trip is greater than 15 minutes. 
In figure 3 we plot the rank distribution for the daily activities together with 
an exponential interpolation 

p{k) cx a'' (2) 

that provides a very good fit of experimental observations. The equation (2) 
is consistent with the assumption that, in average, the individuals behave as 
independent random particles which define their daily agenda in a random way. 
According to previous hypotheses, it is possible to compute in analytical way the 
single trip length distribution, as the distribution realized by uniformly spread- 
ing k points into a given segment of length L. A simple calculation provides the 
single trip length distribution in the form (see appendix) 

TV 
k=l 

where c is a normalizing factor and N is the maximum number of daily activities; 
we remark that the choice of the points in the segment is contextual without any 
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Fig 3. Rank distribution of daily activities computed from the GPS data( circles). The con- 
tinuous line is an exponential interpolation according to (2) with a ~ .7. 



time-ordering. It is quite natural to assume that there should exist a correlation 
between the number of daily activities N and the daily mobility length L, but 
the GPS data do not suggest any correlation function, so that we decided to use 
an effective daily length in the theoretical distribution (3) to make a comparison 
with the empirical one. It turns out that If we exclude the very short paths, the 
curve (3) fits very well with the experimental data (see fig. 4) with an effective 
daily mobility length L — Km (the result is not too sensitive to this particular 
value and we have not performed any optimization procedure) . As a final remark 
from the figure 4, we observe that the long trip length distribution differs from 
an exponential behavior and it rather seems to follow a power law. 



4. Activity downtime distribution 

Time is the second fundamental individual variable of human mobility, directly 
related to the dynamical realization of daily agenda. From the GPS data base 
in the Florence area we have computed the downtime spent in each daily ac- 
tivity by a fixed individual; we discard from the activities the sleeping time 
linked to circadian rhythms. The distribution of the activity downtimes of the 
recorded individuals is plotted in fig. 5, recovering the well known Benford's 
law[13]. In order to give a microscopical interpretation of the empirical down- 
time distribution, we assume that individuals cannot determine a priori each 
activity downtime, because this is varied depending on unpredictable circum- 
stances. According to this hypothesis, in the average, each particle has a finite 
mobility time at disposal to perform the desired activities, and he consumes the 
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Fig 4. Single trip length distribution computed from the GPS data (circles). The continuous 
line corresponds to an interpolation with the distribution (3) by fixing the maximal number 
of daily activities at N = 18 according to fig, 3 and an effective daily mobility length L = 30 
Km (not optimized). 




Fig 5. Activity downtime distribution computed from the GPS data (circles); we have normal- 
ized the data at the maximal value. The continuous line corresponds to an interpolation with 
a power law x" with a = .97 which is consistent with the Benford's law (4). 
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time in successive random choices up to the end of the whole mobihty time. If 
one computes the interval distribution that is obtained by choosing successively 
k points in a given segment, one get analytically the Benford's distribution [13] 



N 



Pit) cx - ^ 



k=l 



(lnt)(A:- 1) 



(4) 



Shortly, the statistical results of the monthly mobility in the Florence area, 
recorded by the GPS data on vehicles, suggest that the macroscopic average 
properties are the same of those of Boltzmann's particles moving in a homoge- 
neous space with an average energy and a finite time at disposal. The energy 
introduces a global constraint in the individual daily mobility, whereas the time 
can be seen as a local constraint in the activity planning since it is consumed 
step by step. 

However computing the distribution of the total activity downtime for the mon- 
itored vehicles, we can get an idea of the typology of urban mobility described 
by our sample. The results are plotted in fig. 6 We remark the presence of two 
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Fig 6. Daily activity downtime distribution. We remark three characters of the underlying 
mobility: the first peak is related to short downtime activities f~ 1 h), the second peak (~ 
10 h) is probably due to working people and the long tail (> 12 h) could be explained by the 
presence of business vehicles that are used by different people. 



peaks: one is related to short downtime activities, that probably corresponds to 
a specific use of the vehicle for a single trip, whereas the second peak centered 
at ~ 10 h denotes people performing a more complex mobility agenda, which 
contains the working activities. Finally the presence of a long queue for the daily 
activity downtime (> 12 h) is probably due to business vehicles in our sample, 
that are used by different people. 
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5. Activity degree distribution 

Both the Boltzmann distribution for the mobihty energy and the Benford's law 
for the activity downtime enroll stochastic features of the system, but they do 
not explain how such features can be related to the individual daily agenda, that 
are certainly the result of a cognitive behavior. In order to study this question, 
we perform a statistical analysis of the downtime related to daily activities, 
considering the monthly degree k for the different individual activities (i.e. the 
number of times that a citizen repeats a certain activity during a month) [14]. 
Let t the activity downtime, we introduce the join probability p{t, k) to denote 
the probability of finding a A:-degree activity associated to a downtime t. Then 
by definition we have to recover the Benford's law (4) by summing over k 

Y^p{t,k)^^ (5) 
k ^ 

We have also the equality 

p(t,k) ^p{t I k)kp{k) 

where p{t \ k) is the conditional probability for a downtime t considering only 
the fc-degree activities, and p{k) is the probability to detect a k degree activity; 
the factor k takes into account the multiplicity of the k degree activities. The 
study of the conditional probability p{t \ k) can shed some light to understand 
the mobility habits related to the use of private vehicles and to face the question 
of the relevance of repeated activities both in the mobility and in the use of time. 
Remarkably the experimental observation suggest the existence of an universal 
probability distribution f{u) for the normalized downtime t/ < t >k'. 

= (6) 

<t>k 

where < t >k is the average downtime for the /c-degree activities. We read this 
universal function as the signature of the fact that individuals organize their 
time, when performing a private car mobility, in a common way independently 
from the specific activity, i.e. the relative downtime fluctuations are the result of 
a stochastic universal mechanism. Moreover there should exist a common feature 
among the individuals, concerning how they manage the downtime related to 
the /c-degree activities, since only the average value < t >fc characterizes the 
k dependence of the conditional probability p{t \ k). This universal character 
could be explained thinking that the t/ < t >k variable is a generic "measure" 
of the mobility actions, valid for every individual. More precisely t/ < t >k can 
be considered the temporal norm for all the mobility related activities. From the 
empirical data we detect ~ 3 x 10^ activity downtimes and we have computed 
the dependence of the average value < t >k using the degree k = 3,. .,20. It 
is evident from fig. 7 that we have an almost linearly increasing behavior of 
< t >fc as the degree k increases. This means the existence of a relation between 
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Fig 7. Dependence of the average downtime < t >k from the activities degree k. The contin- 
uous line refer to a possible interpolation with the exponential function (7). 

the activity degree and the activity "use value" (individual satisfaction, profit, 
etc..) that introduces an individual tendency to repeat and to spend time in 
the activities with a relevant added value [15]. A possible local interpolation of 
the empirical data is obtained by using the function (continuous line in fig. 7) 

< t >fccx exp(7fc'') (7) 

where a ~ .3 and 7 ~ .7. In figures 8,9 and 10 we plot the empirical probability 
densities for different degree (from fc = 3 to fc = 20) to investigate the existence 
of the universal distribution f{u). There is a decreasing of the data number as 
k increases, but all the distributions are computed with a sample of the same 
order (from 4 x lO'' to 10**). The figures enlighten three different features. There 
is a collapse of all the curves on a unique distribution: this is clear in the figure 
8 (the tail spread is consistent with statistical fluctuations) and in the first part 
of all the plotted distributions that contains the great majority of the data. All 
the distributions show a big contribution from the short times activities and a 
fast decaying tail for large {t/ < t/^ >) > 2. There is a smooth rise of a "signal" 
as k increases denoted by the appearance of two peaks at t/ < t >k— 1,3: this 
is clear in the last figure 10. Therefore the empirical observation gives a strong 
indication for the existence of an universal distribution f{u) for the normalized 
activity downtime, even if when we consider high degree activities (fc > 10) some 
new features appear but with a small statistical weight. A possible interpolation 
of the distribution /(u) is given by 

fiu) cx ie-"" (8) 
u 

imsart-generic ver. 2009/08/13 file: stat_law.tex date: December 22, 2009 



A. Bazzani et al. /Statistical Laws in Urban Mobility 11 




Normalized activity time 

Fig 8. Empirical distributions of the k-degree activity downtime as a function of the normal- 
ized downtime t/ < t >fe. The different symbols refer to the different activity degrees: k = 3 
(circles), k = 4 (squares), k = 5 (rhumbs), k = 6 (up triangles), k = 7 (down triangles). 




Normalized activity time 



Fig 9. Empirical distributions of the k-degree activity downtime as a function of the normal- 
ized downtime t/ < t >j.. The different symbols refer to the different activity degrees: k = 8 
(circles), k = 9 (squares), k = 10 (rhumbs), fc = 11 (up triangles), k = 12 (down triangles).. 
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Fig 10. Empirical distributions of the k-degree activity downtime as a function of the normal- 
ized downtime t/ < t >fc. The different symbols refer to the different activity degrees: fc = 16 
(circles), fc = 17 (squares), k = 18 (rhumbs), k = 19 (up triangles), k = 20 (down triangles). 

where the coefficient a has a value ~ .4. The distribution (8) is singular at the 
origin so that the interpolation is certainly approximated at it ^ (see fig. 14 
in the appendix). The exponential decay is the typical Boltzmann statistics as 
for the path length distribution, whereas the behavior is consistent with 
the Benford's law for the downtime distribution. So that this "universal distri- 
bution" seems to mix both the features shown in fig. 4 and 5. To explain the 
singular f{u) trend at the origin, we guess that it can be a sign of a strong 
free-will in the individual behavior in the short time activity range. Surely for 
a more precise justification further study are required. We can use the interpo- 
lation (8) to extract the signal from the high degree activity distribution, by 
computing the ratio between the empirical distribution data and the interpola- 
tion (7); in the figure 11 we plot the ratio distribution results. As it can be seen, 
the empirical data define two peaks centered at t ~ 4 hours and at i ~ 10 that 
are common to all the distributions when k = 16, ...,20. Even if these peaks 
are not statistically relevant, they can be related to individuals that perform 
repeated activities linked to the canonical working time schedule. Clearly the 
working time schedule introduces further constrains to the individual mobility 
agenda, that are not taken into account by the universal distribution f{u). Now 
the existence of an universal distribution implies (cfr. eq. (6)) 



Then using the interpolation (7) and performing the change of variable u{k) = 
tj exp(7fc°), we obtain that the Benford's law (5) implies a power law distribu- 
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Fig 11. Ratio between the empirical distributions of the k-degree activity downtime and the 
interpolation (7) as a function of time (h). The different symbols refer to the different activity 
degrees: fc = 16 (circles), fc = 17 (squares), fc = 18 (rhumbs), fe = 19 (up triangles), fc = 20 
(down triangles). Two peaks at a downtime t c:^ A h and at t ^ 10 h that are common to all 
the distributions can be observed. 



tion for the activity degrees(see appendix) 

- ^ (10) 

According to the estimate (7), we expect an exponent ~ —1.7. In fig. 12 we plot 
the empirical activity degree distribution with a numerical interpolation by a 
power law fc"''; the data provide b ~ 1.6 which is consistent with the analytical 
estimate (10). 



6. Conclusions 

The citizens mobility is an interesting social phenomenon that involves a large 
number of " intelligent" elementary components with a free will individual prop- 
erty, so that we can speak of a cognitive dynamics. In this paper we analyze a 
lot of car mobility data for the Florence area, showing the emergence of three 
robust statistical laws for the path lengths, the activity downtime and degree. 
These laws can be explained as the direct consequence of some simple hypothe- 
ses as the particle independence, and above all assuming the existence of an 
individual mobility energy and of a finite individual time spent for the desired 
urban activities. Moreover, from the downtime distribution we deduce an uni- 
versal function which fits with empirical observations and data. We think that 
this universal function can be interpreted as an indication of the cognitive time 
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Activity Degree 

Fig 12. Empirical distributions of the activity degree (circles). In a log-log scale we enlighten 
the interpolation with a power law with b ~ 1.6 (continuous curve). 

perception common to all human beings, or at least surely common to the car 
drivers. Finally the urban mobility is clearly complex, but the our steady state 
statistics cannot point out the typical complexity signatures as, for instance, 
self-organized states. To detect complexity in mobility dynamics, it is necessary 
to investigate the transients, i.e. the states far from equilibrium. 
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8. Appendix 

Let us consider k stochastic variables uniformly distributed in the unit segment, 
the probability that a segment of length < x is empty can be estimated according 

V{< x) = l- (l-x)'' 

As a consequence the probability density that a certain segment x is empty is 
given by 

p{x) = ^ = k{l - x)''-^ 

Therefore if one choices randomly an integer number k in the interval [1,-/V], 
the probability density for a segment of length x conditioned by the choice k is 

Pk{x) oc {k + l)k{l - x)''-'^ xe[0,l] (11) 

since we have to take into account k + 1 possible segments. The probability (11) 
has to be weighted by the probability p{k) oc a*^ to have k points so that the 
probability density to detect a segment of length x for any choice k is 

(-1 ^2 ^ 
= (2 - .).(1 - an Na^^Hl - a) g/^ + " ^^"'^ ^''^ 
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where we have introduced a normahzing factor. In fig. 13 we show the com- 
parison between the equation (12) and a MonteCarlo distribution with N — 7. 
The distribution (12) allows an analytical approach to the single trip length 




Fig 13. MonteCarlo simulations for the probability density of intervals in a given segment 
defined by randomly choosing n points in the interval uniformly distributed with n a random 
integer < N = 7. The continuous line is the plot of the theoretical distribution (12). 



distribution (cfr. fig. 4 in the paper). 

The empirical conditional distributions p(t \ k) for different degrees fc as a func- 
tion of the normalized activity downtime t/ < t >k suggests that the statistically 
relevant can be described according to 

ri, I = ISt^liil (13) 
where we introduce an universal fimction /(t/ < t >k) which can be interpolated 

by 

f{u) cx ^e-"" (14) 

The results are shown in figure 14 for the activity degrees = 3, ..,20. 

There is a strict relation between the activity degree distribution (see fig. 12 
in the paper) and the existence of an universal distribution probability f{u) in 
eq. (13) Indeed taking advantage from the dependence of < t >k on the degree 
k pointed out by experimental observations (see fig. 7 in the paper) we perform 
the change of variables 

I' ^^ (15) 

\u =t/<t>k 
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Normalized activity ti 



Fig 14. Empirical distributions for the conditional probabilities p{t \ k) for activities degrees 
A: = 3, .., 20 as a function of the normalized activity downtime t/ < t >k (different symbols). 
The continuous line refers to an interpolation with the function (4)- 



in the join probability distribution p(k, t) of degree and downtime (cfr. cq. (9)). 
Using the definition (5), we get the new distribution 

p'{u,t) = f{u)k{u)p{k{u))f = -f{u)kp{k)^^ f^^y^' 

au t \ dk 

where k has to be read k(u) in the r.h.s. and p{k) is the activity degree distribu- 
tion. In the previous formula we approximate interpolate the discrete variable k 
with a continuous variable. By integrating of u we have to recover the Benford's 
law oc 1/i for the global activity downtime distribution (see fig. 5 in the paper). 
Since f{u) is normalized as probability distribution, this is possible if 

kp{k) < t >k ^|r^^ = const. (16) 

According to the interpolation < t >fc(x exp7fc" of the experimental data as 
shown in the figure 7 in the paper, we explicitly have 

^^1^ oc r-ie^''^" oc < t >k 

dk 

therefore the condition (16) reads 

k'^~'^p{k) = const. 

i.e. a power law distribution of the activity degree with exponent < 2. This is 
consistent with the experimental observations as shown by the figure 12 in the 
paper. 
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